Notes
Blast only appears to match at bp 1-300
Essentially sptlc1 with high homology is a combination of
HS110949
AM857503
FQ665912
and maybe a touch of CU989992
Several 454 sequences align with high homology region (found using Blast)
gnl|BL_ORD_ID|347250 FU6OSJA02ILTNQ 922 0E00
gnl|BL_ORD_ID|406924 FU6OSJA02HLXZF 916 0E00
gnl|BL_ORD_ID|301805 FU6OSJA02JII5Q 865 0E00
gnl|BL_ORD_ID|432015 FU6OSJA02F8582 585 1E-165
gnl|BL_ORD_ID|369050 FU6OSJA02JMQRM 565 1E-159
gnl|BL_ORD_ID|603802 FV2TRRU02GIMHG 272 3E-71
Starting over from scratch.
Problem: Blasting this consensus only gets significant protein hit to bp 629.
Also very concerned that top hits are all mammals.
Consensus
CCACGCGTCCGATAAAGTCCGCNCTTTTTCTGGAAGAAAGTTTGTCATTTGGCATTCTTGGTAGCAATGGGAAAGGTGTAACAGAACACTACAATATCTCTCCAGATGATATTGACTTGATTGCGGCCTCATTAGAAAATGCTATTGGATCAACAGGAGGCTTTTGCTGTGGGAAGAAATATATTGTGGACCATCAACGATTGTCAGGACTTGGATATTGCTTCTCAGCATCTTTACCTCCCATGTTAGCAACTGCAGCTATCGAGTCCCTGCGTTTGATTGATGAAAAACCAGGAATGCTTGTTGAATTGCGAGAAAACTGTGAAAAAATTCACAGCAAGCTGAGCGATATAAATGGAACCGTCATTGTAGGGGAACCTATTTCCCCAGTCAAGCACATTAGACTTGCAGAGCCAAGTACTGACAGGGACTTTGATGTGCAGACTCTGCAGAAGATAGCAGATCTCTCGAGAGACAACAAAGTTGCTGTGACGTTGGCTCGCTACTTAGAAGAGGAGGAACATAAACTTCCATTGCCAAGCATCCGGATATCTGTGAACAACCAGCTTTCAGATGAAGAAATTGACACTGTCTTCACTACACTAAGTGAAGCTTTTCAGAAAATCATCACTCATTAATGTGAAGTGCGAACAAATATCTATTGTCAGAAATATTAGGGAACAGTATTTATTTGATACTTTCTTATTGTAAAGGGTAATTGCTACACAAATCATCTTGTCTTAGACTAGGTCTTGTACAAGTATGTATATCTTAGATAATTTTCTGTATAAGGTGTTTGACATTACACATTTTCTTTTGGCCATAAGAACATTTAAAGTAAATGCATGTGTTTAAATTTGTAATTTTTTTAACAAATTGTAATGTCAAATGCCGCAAATATCGGTAGAAATGTGTCAATACACTATTAACAAGTTAAGTCTTCATAGGGCATTTATCTTGTATGTTCCAAAGGTGAGAGTTTTGTCTGCTGAGATTTGTTCACTTGAATGTTAATTTAGGATCAATCATTGTGTAAGAATTTTTTGATTTTGGTGAATTTTTATTAGAATGTTACTTGTCATCTATTGCTTTTTTCTACAAAAGTTTGTTGCTTTGAATAATATTTTGACAGACGTTG
ESSENTIAL LINK
From this we see high level of conservation
C. elegas fasta
>gi|71982616:80-1456 Caenorhabditis elegans Serine Palmitoyl Transferase famiLy family member (sptl-1) (sptl-1) mRNA, complete cds ATGGGATTTCTACCAGATTCGTGGCATTTCTACATTGAAACTTTGCTCGTTGCACTTCTTGCCTATGTTG TCATGCGGAACAGATCCAAACGTCAACAAGAAAAACTTTCAAAGAAACTAACTGAAAGGCAGAAAGATGA ATTAATTGCTGATTGGACTCCGGAACCATTGGTTCCTGAAACACCACAAGACCATCCTGTACTGAATCCA AAATATGCTGATGGAAAAATGACAAAGGATGTTTCGATCGATGGCGAAAAGTATCTCAACATGGCATCAA CAAATTTCCTCAGCTTTATCGGAGTCAAACGGATTGAAGACCGTGCGAAACAGACGATTTTCAAGTACGG CGTAGGATCGTGCGGGCCACGTGGATTCTACGGAACTGTTGATGTTCATTTGGACCTTGAAAAAGAATTG GCAAAATTTATGGGATGTGAGGAGGCTGTTCTGTACAGCTATGGGTTTGCTACAGTATCTTCAGCAATTC CCGCATACGCTAAAAAGGGAGATGTCATCTTTGTTGACGAAGGTGTTAACTTTGCAATCCAAAAAGGTCT TCAAGCATCACGAAGTCGTGTTGAATATTTCAAGCATAACGACATGGAGCACTTGGAGAGGTTATTACTG GAACAAGAACAAAGAGACAAGAAAGACCCGAAAAAGGCCAAGTCTGTACGGCGATTCATTGTTGTTGAAG GCCTCTATGTAAATTATGCGGATCTTTGCCCACTTCCCAAGATTATCGAGTTCAAATGGCGATTCAAAGT TCGTGTTTTCATTGACGAAAGCTGGTCATTTGGAGTCATTGGAAAAACCGGAAGAGGAGTCACCGAGCAC TTCAACGTTCCGATGGAGGACGTTGATATGGTAATGGCCTCACTCGAAAACGCATTGGCGTCAACTGGAG GATTCTGTGTTGGAAGATCATATGTAGTTGGGCATCAGCGGCTTTCAGGACTCGGATATTGCTTTTCTGC TTCTCTTCCACCTCTTCTTGCAACGGCCGCGTCTGAAGCTATTTCTATCATTGATGAAGAACCAAGTAGA GTTCAAAAAGTTACTGAAATGGCTATAAATGGTCAGAAAAAGCTACAAGATGCGCTGAGTGGATCGAAAT TTTCATTGCAAGGATGTCCAGAAAGTCCAATGAAGCATATCTATTACAATGGGGAAGATGAAGAAAAGCA ATTGGATACATTTGTGGAGACAGTCTTCACAAAAAATCATCTCCTACTGACCAGAGCTCGATACCTCGAC AAGGACGAATTGTTCAAAATTCGGCCAAGCATTCGAGTAATGTTCCAACACGACTTAACTGAAGAGGAGA TTCAAAGAGCTGTCGATGCTATCCGAGTTGTTGCTCATAAATTCTAA
megablast on Crassostrea nt database (no hits)
blastn on Crassostrea nt database (insig hits)
blastn on Crassostrea est database (below)
tblastx on Crassostrea est database
FQ6659512 nice match
>gi|318050034|gb|FQ665912.1|FQ665912 FQ665912 Crassostrea gigas library (Genoscope - CEA) Crassostrea gigas cDNA clone WY0AAA70YA15FM1, mRNA sequence GGACGCCATGGCGTCGACGTTCATTCCTCAAACATGGGAAATGTATGATATGTTTCAAGCCCTGTTGCAG GCTCCCTCATATCATCTTTTCTTCGAAGCATTGCTTATCATATGGATTTTTAAGCTGTTGTTTTTCTCTA AAGCCTACGCCCCAGAATCCGTCCTAACAGAAAAGGAAAAGGAGGAGTTGATTGCAGAATGGCAGCCGGA ACCTTTAGCTCCCGAAATTCCAGAGGACCACCCTGTTTTAATGGCGATGGAAAATAATATTATTACAGGG AAACCAGGAAAATATGTAACCATTAATGGAAAATCTTGTGTTAACATGGCTACATTAAACTTTCTTGGCA TGGCAGGGAACCCATCTGCGGAGGTAGAGGCCATCAAAACTCTGAAAAAGTATGGAGTGGGATCATGTGG ACCCAGAGGTTTCTATGGCACAATGGACGTCCATTTAGAACTTGAAGACAAAATAGCAAAATTCATGAAT TGTGAGGAAGCTATATTATATGCTTTTGGCTTTGCGACCATAGCGAGTGCTATCCCAGCTTACTCGAAAC GTGGAGATGTAATATTTGCTGATGAGGGAGTATGCTTTGCTATACAGAAAGGACTCGTTGCCTCAAGAAG CAAAATAAAGTGGTTCAAACACAATGATATGGAGGATCTGGAGCGTCTACTTATTGAACAAGCAAAGGAG GACAAGAAAAACCCTAAGAAAGCCAAAGTGACCAGAAGATTTCTTGTTGTGGAAGGACTCTACATTAACT ATGGTGACTTATGTCCACTTCCAAAATTAGTTGAACTCAAGTGGAAGTAT
megablast FQ6659512 Crassostrea ESTs (no other hits)
blastn against 454 trimmed reads
hits
FU6OSJA02JII5Q
FU6OSJA02F8582
Assembled
Consensus
>FQ665912plusFU6OSJA02JII5Q
GGACGCCATGGCGTCGACGTTCATTCCTCAAACATGGGAAATGTATGATATGTTTCAAGCCCTGTTGCAGGCTCCCTCATATCATCTTTTCTTCGAAGCATTGCTTATCATATGGATTTTTAAGCTGTTGTTTTTCTCTAAAGCCTACGCCCCAGAATCCGTCCTAACAGAAAAGGAAAAGGAGGAGTTGATTGCAGAATGGCAGCCGGAACCTTTAGCTCCCGAAATTCCAGAGGACCACCCTGTTTTAATGGCGATGGAAAATAATATTATTACAGGGAAACCAGGAAAATATGTAACCATTAATGGAAAATCTTGTGTTAACATGGCTACATTAAACTTTCTTGGCATGGCAGGGAACCCATCTGCGGAGGTAGAGGCCATCAAAACTCTGAAAAAGTATGGAGTGGGATCATGTGGACCCAGAGGTTTCTATGGCACAATGGACGTCCATTTAGAACTTGAAGACAAAATAGCAAAATTCATGAATTGTGAGGAAGCTATATTATATGCTTTTGGCTTTGCGACCATAGCGAGTGCTATCCCAGCTTACTCGAAACGTGGAGATGTAATATTTGCTGATGAGGGAGTATGCTTTGCTATACAGAAAGGACTCGTTGCCTCAAGAAGCAAAATAAAGTGGTTCAAACACAACGATATGGAGGATCTGGAGCGTCTACTTATTGAACAAGCAAAGGAGGACAAGAAAAACCCTAAGAAAGCCAAAGTGACCAGAAGATTTCTTGTTGTGGAAGGACTCTACATTAACTATGGTGACTTATGTCCACTTCCAAAATTAGTTGAACTCAAGTGGAAGTATAAAGTCCGCCTTTTTCTGGAAGAAAGTTTGTCATTTGGCATTCTTGGTAGCAATGGGAAAGGTGTAACAGAACACTACAATATCTCTCCAGATGATATTGACTTGATTGCGGCCTCATCAGAAAATGCTATTGGATCAACAGGAGGCTTTTGCTGTGGGAAGAAATATATTGTGGACCATCAACGATTGTCAGGACTTGGATATTGCTTCTCAGCATCTTTACCTCCCATGTTAGCAACTGCAGCTATCGAGTCCCTGCGTTTGATTGATGAAAACCAGGAATGCTTGTT
Blast new consensus on 454 data
Hits include
FU6OSJA02JII5Q (duh)
FU6OSJA02F8582
FU6OSJA02ILTNQ
FU6OSJA02HLXZF
new consensus
>FQ665912plusFU6OSJA02JII5Q_FU6OSJA02HLXZF
GGACGCCATGGCGTCGACGTTCATTCCTCAAACATGGGAAATGTATGATATGTTTCAAGCCCTGTTGCAGGCTCCCTCATATCATCTTTTCTTCGAAGCATTGCTTATCATATGGATTTTTAAGCTGTTGTTTTTCTCTAAAGCCTACGCCCCAGAATCCGTCCTAACAGAAAAGGAAAAGGAGGAGTTGATTGCAGAATGGCAGCCGGAACCTTTAGCTCCCGAAATTCCAGAGGACCACCCTGTTTTAATGGCGATGGAAAATAATATTATTACAGGGAAACCAGGAAAATATGTAACCATTAATGGAAAATCTTGTGTTAACATGGCTACATTAAACTTTCTTGGCATGGCAGGGAACCCATCTGCGGAGGTAGAGGCCATCAAAACTCTGAAAAAGTATGGAGTGGGATCATGTGGACCCAGAGGTTTCTATGGCACAATGGACGTCCATTTAGAACTTGAAGACAAAATAGCAAAATTCATGAATTGTGAGGAAGCTATATTATATGCTTTTGGCTTTGCGACCATAGCGAGTGCTATCCCAGCTTACTCGAAACGTGGAGATGTAATATTTGCTGATGAGGGAGTATGCTTTGCTATACAGAAAGGACTCGTTGCCTCAAGAAGCAAAATAAAGTGGTTCAAACACAACGATATGGAGGATCTGGAGCGTCTACTTATTGAACAAGCAAAGGAGGACAAGAAAAACCCTAAGAAAGCCAAAGTGACCAGAAGATTTCTTGTTGTGGAAGGACTCTACATTAACTATGGTGACTTATGTCCACTTCCAAAATTAGTTGAACTCAAGTGGAAGTATAAAGTCCGCCTTTTTCTGGAAGAAAGTTTGTCATTTGGCATTCTTGGTAGCAATGGGAAAGGTGTAACAGAACACTACAATATCTCTCCAGATGATATTGACTTGATTGCGGCCTCATTAGAAAATGCTATTGGATCAACAGGAGGCTTTTGCTGTGGGAAGAAATATATTGTGGACCATCAACGATTGTCAGGACTTGGATATTGCTTCTCAGCATCTTTACCTCCCATGTTAGCAACTGCAGCTATCGAGTCCCTGCGTTTGATTGATGAAAAACCAGGAATGCTTGTTGAATTGCGAGAAAATTGTGAAAAAATTCACAGCAAGCTGAGCGATATAAATGGAACCGTCATTGTAGGGGAACCTATTTCCCCAGTCAAACACATTAGACTTGCAGAGCCAAGTACTGACAGGGACTTTGATGTGCAGACTCTGCAGAAGATAGCAGATCTCTCGAGAGACAACAAAGTTGCTGTGACGTTGGCTCGCTACTTAGAAGAGGAGGAACATAAACTGCCATTGCCAAGCAT
still good match
note - full protein about 473
gnl|BL_ORD_ID|347250 FU6OSJA02ILTNQ 946 0E00
gnl|BL_ORD_ID|301805 FU6OSJA02JII5Q 944 0E00
gnl|BL_ORD_ID|406924 FU6OSJA02HLXZF 940 0E00
gnl|BL_ORD_ID|369050 FU6OSJA02JMQRM 573 1E-161
gnl|BL_ORD_ID|432015 FU6OSJA02F8582 548 6E-154
gnl|BL_ORD_ID|603802 FV2TRRU02GIMHG 280 2E-73
gnl|BL_ORD_ID|410751 FU6OSJA02JOE9S 123 3E-26
new consensus
>>FQ665912plusFU6OSJA02JII5Q_FU6OSJA02HLXZF_FU6OSJA02JMQRM
GGACGCCATGGCGTCGACGTTCATTCCTCAAACATGGGAAATGTATGATATGTTTCAAGCCCTGTTGCAGGCTCCCTCATATCATCTTTTCTTCGAAGCATTGCTTATCATATGGATTTTTAAGCTGTTGTTTTTCTCTAAAGCCTACGCCCCAGAATCCGTCCTAACAGAAAAGGAAAAGGAGGAGTTGATTGCAGAATGGCAGCCGGAACCTTTAGCTCCCGAAATTCCAGAGGACCACCCTGTTTTAATGGCGATGGAAAATAATATTATTACAGGGAAACCAGGAAAATATGTAACCATTAATGGAAAATCTTGTGTTAACATGGCTACATTAAACTTTCTTGGCATGGCAGGGAACCCATCTGCGGAGGTAGAGGCCATCAAAACTCTGAAAAAGTATGGAGTGGGATCATGTGGACCCAGAGGTTTCTATGGCACAATGGACGTCCATTTAGAACTTGAAGACAAAATAGCAAAATTCATGAATTGTGAGGAAGCTATATTATATGCTTTTGGCTTTGCGACCATAGCGAGTGCTATCCCAGCTTACTCGAAACGTGGAGATGTAATATTTGCTGATGAGGGAGTATGCTTTGCTATACAGAAAGGACTCGTTGCCTCAAGAAGCAAAATAAAGTGGTTCAAACACAACGATATGGAGGATCTGGAGCGTCTACTTATTGAACAAGCAAAGGAGGACAAGAAAAACCCTAAGAAAGCCAAAGTGACCAGAAGATTTCTTGTTGTGGAAGGACTCTACATTAACTATGGTGACTTATGTCCACTTCCAAAATTAGTTGAACTCAAGTGGAAGTATAAAGTCCGCCTTTTTCTGGAAGAAAGTTTGTCATTTGGCATTCTTGGTAGCAATGGGAAAGGTGTAACAGAACACTACAATATCTCTCCAGATGATATTGACTTGATTGCGGCCTCATTAGAAAATGCTATTGGATCAACAGGAGGCTTTTGCTGTGGGAAGAAATATATTGTGGACCATCAACGATTGTCAGGACTTGGATATTGCTTCTCAGCATCTTTACCTCCCATGTTAGCAACTGCAGCTATCGAGTCCCTGCGTTTGATTGATGAAAAACCAGGAATGCTTGTTGAATTGCGAGAAAATTGTGAAAAAATTCACAGCAAGCTGAGCGATATAAATGGAACCGTCATTGTAGGGGAACCTATTTCCCCAGTCAAGCACATTAGACTTGCAGAGCCAAGTACTGACAGGGACTTTGATGTGCAGACTCTGCAGAAGATAGCAGATCTCTCGAGAGACAACAAAGTTGCTGTGACGTTGGCTCGCTACTTAGAAGAGGAGGAACATAAACTTCCATTGCCAAGCATCCGGATATCTGTGAACAACCAGCTTTCAGATGAAGAAATTGACACTGTCTTCACTACACTAAGTGAAGCTTTTCAGAAAATCATCACTCATTAATGTGAAGTGCGAACAAATATACTATTGTCAGAAATATTGGGGAACAGTATTTAATTTGAATACTTTC